Gender-Based Vocation Identification in Swedish 19th Century Prose Fiction using Linguistic Patterns, NER and CRF Learning
نویسندگان
چکیده
This paper investigates how literature could be used as a means to expand our understanding of history. By applying macroanalytic techniques we are aiming to investigate how women enter literature and particularly which functions do they assume, their working patterns and if we can spot differences in how often male and female characters are mentioned with various types of occupational titles (vocation) in Swedish literary texts. Modern historiography, and especially feminist and women’s history has emphasized a relative invisibility of women’s work and women workers. The reasons to this are manifold, and the extent, the margin of error in terms of women’s work activities is of course hard to assess. Therefore, vocation identification can be used as an indicator for such exploration and we present a hybrid system for automatic annotation of vocational signals in 19th century Swedish prose fiction. Beside vocations, the system also assigns gender (male, female or unknown) to the vocation words, a prerequisite for the goals of the study and future in-depth explorations of the corpora.
منابع مشابه
Character Profiling in 19th Century Fiction
This paper describes the way in which personal relationships between main characters in 19 century Swedish prose fiction can be identified using information guided by named entities, provided by a entity recognition system adapted to the 19 century Swedish language characteristics. Interpersonal relation extraction is based on the context between two relevant, identified person entities. The re...
متن کاملComments on Nonfinite Adverbial Patterns in English Prose Fiction: A Simple Model for Analysis and Use
This study aims to present an accessible model of some frequent nonfinite adverbial types occurring in English prose fiction. As its main syntactic argument, it recognizes that these adverbials are mostly elliptical in that there are some dependent-clause markers one can assume to be implicit when supplying those elements back into the clause complex. Some comments are provided at the end on th...
متن کاملAn Applied Linguistics Look at the Linguistic Comparison of Nominal Group Complexity between Two Samples of a Genre
The roles and effects of changes in syntax on comprehension and processing effort, and the relationships between these two, comprise a large and separate field of inquiry, with the general belief now in place that such changes and variations bring about varied psycholinguistic and discursive implications for comprehension, manifesting themselves differently in different genres.The current study...
متن کاملNaming the Past: Named Entity and Animacy Recognition in 19th Century Swedish Literature
This paper provides a description and evaluation of a generic named-entity recognition (NER) system for Swedish applied to electronic versions of Swedish literary classics from the 19th century. We discuss the challenges posed by these texts and the necessary adaptations introduced into the NER system in order to achieve accurate results, useful both for metadata generation, but also for the en...
متن کاملThe position of Persian language and literature in Ottoman’s 19th century literature and historical developments
With the spread of western reforms in the 13th/9th century, Ottoman’s literature was reformed either. To reform Ottoman literature, they decided to transform the Ottoman language and literature relations with Persian language and literature. On one hand, they considered problems of Ottoman literature regarding Pindaric and its inefficiency for entering new areas such as novel, drama, and journa...
متن کامل